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Abstract 

The problem of parameter estimation by the observations of the 
two-state telegraph process in the presence of white Gaussian noise is 
considered. The properties of estimator of the method of moments are 
described in the asymptotics of large samples. Then this estimator 
is used as preliminary one to construct the one-step MLE-process, 
which provides the asymptotically normal and asymptotically efficient 
estimation of the unknown parameters. 
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1 Introduction 

This work is devoted to the problem of parameter estimation by the observa¬ 
tions in continuous time X'^ = {X (t), 0 < t < T) of the following stochastic 
process 

dXt = Y{t)dt + dWt, Xo, (1) 

here hUt, 0 < t < T is a standard Wiener process, X (0) = Xq is the initial 
value independent of the Wiener process and Y (f) ,0 < t < T is a two-state 
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{Ui and 1 / 2 ) stationary Markov process with transition rate matrix 

-A A 

/i —jjL 

We snppose that the values A > 0 and /r > 0 are unknown and we have to 
estimate the two-dimensional parameter d = (A,/i) G 0, here 0 = (co,ci) x 
(co, Cl) by the observations X*, 0 < t < T, i.e., the estimator 'd*j.,0<t<T 
is stochastic process. Here cq < ci are positive constants. 

Therefore, our goal is to construct an on-line estimator-process = 
*rp,0 < t < T), which can be sufficiently easy to evaluate and is asymp¬ 
totically optimal in some sense, as T —)■ 00 . This estimator-process we con- 
structe in two steps. First we introduce a learning interval [0,T'^], here 
6 E (|, 1) and propose a T^-consistent preliminary estimator constructed of 
the method of moments. Then we improve it up to asymptotically efficient 
one with the help of slightly modified one-step MLE procedure. 

Such model of observations is called “Hidden Markov Modef’ (HMM) or 
partially observed system. There exists an extensive study of such type HMM 
for discrete time models of observations, see, for example, la. n, 121 and the 
references therein. For continuous time observation models this problem is 
not so-well studied. See, for example, Elliot et ah [5], here the Part HI “Con¬ 
tinuous time estimation” is devoted to such models of observations. One can 
find there the discussion of the problems of filtration and parameter estima¬ 
tion in the case of observations of finite-state Markov process in the presence 
of White noise. The problem most close to our statement was studied by Chi- 
gansky |3], who considered the parameter estimation in the case of hidden 
finite-state Markov process by continuous time observations. He showed the 
consistency, asymptotic normality and asymptotic efficiency of the MLE of 
one-dimensional parameter. The case of two-state hidden telegraph process 
studied in our work was presented there as example but there supposed that 
X = fi. The problem of parameter estimation for similar models including 
the identification of partially observed linear processes (models of Kalman 
hltration) were studied by many authors. Let us mention here jTU], |1], |17) . 
m- The problem of asymptotically efficient estimation for the model of 
telegraph process observed in discrete times was studied in [6]. 

The proposed here one-step MLE-process is motivated by the work of 
Kamatani and Uchida jS] who introduced Newton-Raphson multi-step esti¬ 
mators in the problem of parameter estimation by discrete time observations 
of diffusion process. In particular, it was shown that multi-step Newton- 
Raphson procedure allows to improve the rate of convergence of preliminary 
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estimator up to asymptotically efficient one. Note that preliminary estima¬ 
tor there is constructted by all observations and they studied estimator of 
the unknown parameter. Applied in this work estimator-process uses the 
preliminary estimator constructed by the observations on the initial learning 
interval and follows the similar construction as that one introduced in the 
work [13] (see as well [12] )• 


2 Problem statement and auxiliary results 

We start with description of the MLE for this model of observations. The 
stochastic process CQi according to innovation theorem (see mi, Theorem 
7.12) admits the representation 

dAi = m {t, ■&) df -|- dlW, Aq, 0 < f < T, 

here m {t, ■&) is the conditional expectation 

m {t, ^9) = [A it) I A/"] = 2 /iP^ (A (f) = (A (f) = . 

Here is the cx-algebra generated by the observations up to time t, i.e., 
:= cr (Aj, 0 < s < f) and H7, 0 < f < T is an innovation Wiener process. 

Let us denote 

TT (f, = P, (A {t) = y,\F^) , P^ (A (f) = y^\F^) = 1 - vr (f, ^). 

Hence 

m (f, '&) =y 2 + {yi - 2 / 2 ) vr (f, -d). 

The random process vr (f, -d), 0 < f < T satishes the following equation (see 
mi, Theorem 9.1 and equation (9.23) there) 

dvr (f, 'd) = [p, — (A -|- p) TT (f, d) 

+7r (f, -d) (1 - vr (f, d)) ( 2/2 - 2/i) ( 2/2 + ( 2/1 - 2 / 2 ) tt (f, 'd))] df 
+ TT (f, 'd) (1 - TT (f, 'd)) ( 2/1 - 2 / 2 ) dA*. (2) 

Denote by G ©j the measures induced by the observations A* = 

(As, 0 < s < f) of stochastic processes ([T]) with different d in the space of re¬ 
alizations C [0,f] (continuous on [0,2:] functions). These measures are equiv¬ 
alent and the likelihood ratio function 

dp(0 

L (^9, AA = - ^ (A*) , ^9 G 0, 0 < 2 < T 

dPk^ 
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can be written as follows 


L (^9, X*) = exp I y m{s,'&)dXs — - J m{s,'&)ds 

Here Pq^ is the measure corresponding to with Y (s) = 0. 
The MLE-process 'dt,T is dehned by the equation 

L X^) = sup L(^,X^), 0<t< T. 

^ ’ ' i?Ge 


( 3 ) 


It is known that in the one-dimensional case (d = 1, A = /r = "d) the MLE 
^T,T = is consistent, asymptotically normal 


M ( 0,1 

and asymptotically efficient (see 0.III)- Here I {•&) is the Fisher information. 

Note that the construction of the MLE-process 'dt,T, 0 < t < T according 
to ([3]) and ([2]) is computationally difficult problem because we need to solve 
the family of equations ([2]) for all'd G 0 and ([3]) for all f G (0, T], 

We propose the following construction. First we study an estimatorof 
the method of moments and show that this estimator is \/T-consistent, i.e., 


E, 


Vf {Ot - 


< c. 


Then using this estimator -§rps obtained by the observations on the learning 
interval here | < <5 < 1, we introduce the one-step MLE-process 

W*) , <t<T. 

Here the empirical Fisher information matrix 

1 /■* 

li ("d) = - / m s) rh (-d, s)* ds —> I ("d), 

t JrpS 

as t ^ oo,T^ = o (t) and the vector score-function process is 

1 

At ("d, X*) = —p m{'d,s) [dX^ — m s) ds] . 
vt Jt^ 

Here and in the sequel dot means the derivation w.r.t. parameter 'd, rh 
is the vector-column of the derivatives rn\ {-d, t) and {-d, t). The estimator 
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is in some sense asymptotically efficient. In particular for = -dtp we 

have 

^{rrp-d)^U (0, I (^9)"^) , 

i.e., it is asymptotically equivalent to the MLE. Note that the calculation of 
the estimator for all t G [T^, T] requires the solution of the equation ([2]) 
for one value ■(9 = only. 


Recall as well the well-known properties of the Telegraph (stationary) 
process Y (t) ,t > 0. 


1. The stationary distribution of the process Y (t) is 


P^{Y{t)=y,} = 




A /i’ 


P^{k'(t) = 2/2} = 


A 


A -|- /i 


(4) 


2. Let us denote Pij (t) = P-d{Y (t) = yj\Y (0) = yi}, then solving the 
Kolmogorov equation we obtain 


Pn (t) = 




A 


+ Pi 2 (t) = 


A 


A 


3 —(A+/i)t 


X fj. X fx 

= _J^ _ 

A -|- /i A -|- /i 


P21 (t) = P22 (t) = 


\ y X y 


X -\- y X y 


(5) 


It follows from (jlj) and ([5]) that 

K (s) = E^K (f) Y{t + s) = 


yiy + 1 / 2 A 
X y 


here 


+ (92 - yif ,, = {Yf + Be-<'+'‘>-, 

(A + /i) 


T7 2/lh + l/2^ rY r ^2 A/i 

Y = E^)Y [t) = , , , D = {y2- yi) 


A -|- yU 


(A -h yY 


( 6 ) 


(7) 


3. Let PY C P be a family of a-algebras, induced by the events 

{Y {s) = yi,0 < s <t,i = 1, 2} . 

Then it follows from ([S]) that for some constant K > 0 and A < T and 
for all s > A, f > 0 the inequality 

[E^ {Y {s + T)Y{t + T) |Pj} - E^ [y (s) Y (f)] | < 

( 8 ) 

holds. 
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3 Method of moments estimator 


Let us consider the problem of the construction of \/T-consistent estimators 
of the parameter by the method of moments. Recall that we observe in 
continuous time the stochastic process 

dXt = Y (t) dt + dWt, Xo, 0<f<T, (9) 

here Wt,0 < t < T is a standard Wiener process, Xq is independent of 
< t < T initial value, Y (t) = Y {t,u) is stationary Markov process 
with two states yi and j /2 and inhnitesimal rate matrix 

-A A 

y -fi 

The processes Y (t) ,t > 0 and Wt,t > 0 are independent. 

We suppose for simplicity that T is an integer number. Introduce the 
condition 

A e [co,ci] ,/i e [co,ci] (10) 

here Cq and Ci are some positive constants. 

To introduce the estimators we need the following notations. 

• The function 

= ( 11 ) 

ry» rY>^ ' ' 

• The statistics 

1 

CT=yI]|A'.+i-A'y-l. (12) 

i=0 

• The random variable is dehned as a solution of the equation 

Cr = ’ 

here 

(Xt \ [ Xt\ 

vt= [y^~^I ■ 

• The event At that the equation f[T3|) has a solution. 


(13) 

(14) 
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• The random variable 


/3t = OlT ^Mt} + (co + Cl) 

The method of moments estimator is = {^t-, ■, here 


(15) 


At — 


y2 - yi 


V2-^ 

f^T] hr =- —f^T- 

2/2 - 2/1 


(16) 


The properties of these estimators are given in the following theorem. 

Theorem 1 Let the condition ffTOj) holds. Then for the estimators ffT6|) and 
some constante C > 0 we have for allT > 0 


E, 


Vf(XT-x 


<c, 


E, 


Vf {flT - /i) 


< c. 


(17) 


The proof is given in several steps. 

The next lemma gives \/T-consistent estimator for Y (see dB)- 

Lemma 1 Let the condition (na he fulfilled. Then the estimator Xt/T is 
uniformly consistent for Y and for any T > 0 




< 


C 

T’ 


(18) 


here the constant C* > 0 does not depend on h. 

Proof. Making use ([6]) we obtain the fllowing relations 


EU^-ri =E, 


ff [F (() - F] d( + 


J'2 


T 


E, 




[F (() - F] d( 

(2/2 - 2/i)') < ^ (^1 + ^ (2/2 - yif^ 


Corollary. The existence of the consistent estimator for and follows 
from (0]) and Lemma 1. Indeed, from the equality 


Y = 


X 


- 2/2 


T 


A T h A T /I 


2/1 
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and Lemma 1 we obtain 




Vr( 

1 

7 


V 1/2 - 1/1 

A + /i y 

'Vf{ 

^y^-T-^Xx 

/i y 

K y2-yi 

A + yU y _ 


<C, 


< a 


( 19 ) 


The statistics 


Xt _ 1 
~¥ ~T 


Y (t) dt + 


Wt 

~T~ 


is the sum of a bounded a.s. random variable and an independent of it 
gaussian random variable with parameters (0,T“^). Hence for rjT defined in 
fHTj) we can write the estimate 


E,c 


1 2 


VT(iit-D) <C, 


( 20 ) 


here the constant C > 0 does not depend on T and The constant D is 
defined in ([7]). 

Note that from the condition flTn]) we have 


A/i 


> 


(A + /i)^ dcf 

and we easily obtain the estimate (l2Up for the estimator 


r]T = max r]T, ^ 


Lemma 2 The following equality holds 

E^Cr = Y^ + (A + /i) 
and under the condition CO we have as well 


E, 


\/T {(fx — E^(^t) 


< C. 


( 21 ) 


( 22 ) 


(23) 


Proof. From stationarity of the process Y [t) and ([6]) we obtain 


1 ^1 


E^Ct = [Xi - Xo] - 1 = E^ / / Y{s)Y{t)dsdt + l-l 

Jo Jo 

= Y^ + 2Dd> (A + /i). (24) 
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Denote 


fi+l 


7* = 


Y (t) dt; AVT (i) = Wi+i - Wi. 


Further, from the equality 

T—1 T—1 T—1 

Cr - E,Ct = i 5^ (t? - E,7|) + i AH' (i) + i ^ ( A»' (■!)" - 1) 


T 

1=0 

follows the estimate 


i=0 


i=0 


'T-l 


12 . 


^T-l 


E, (Ct - E,Ct)" < ( 5^ (7? - E«7?) ) + pE, ( 7,AW' (*) 


J'2 

3 

rp2 


J=0 

fT-1 


. i =0 


E,, ^(AfF(^)'-l) := 3Ji + I 2 J 2 + 3 J 3 . (25) 


.1=0 


From stationarity of Y [t) we obtain 
/T-i ' 2 


T-lT-l 


•^1 = ^7 E, ( ( 7 ? - E,7?) ) ^0 (7 - E.7a (tJ - E,7|) 


'J'2' 

1 

rp2 


i=0 

^-1 r /•! /•! /•! /•! 


i=0 j=0 


Y 

i,j=0 



E^{Y(s)Y(t) 


0 Jo Jo Jo 


E,, [F (|z - j| + si) F (|z -j\+ ti) \Tl] } dsdMsidfi - (26) 

The estimate ([S]) allows to write 

\E^[Y{\z-j\+s,)Y{\z-j\+ t,) I ] - E,,F (si) F {t,) \ < K 
From this estimate, fl26|) and flTOj) we obtain 

T—1 

Ji < — V . 

^ — T2 / ^ — T 


J'2 

i,j=0 

The following estimates are evident 

2 


= (*)) = 4 F e„7? = < ^, 

\ i=0 J i=0 


/T-l 


T-l 


■h = pE, ( ^ [&W (if - 1 ] 


^ FEFAlEOf- 1 ]F p 


. j=0 


J2 


i=0 


The second proposition of the Lemma 2 follows from these estimates and 
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Lemma 3 The function ^ (x) (see dm); has the following properties 



(27) 

(28) 
(29) 


lim $ (x) = 0, 


(x) < 0, for X > 0. 


Proof. From (fT^ we obtain the representations 



which allow to verify the limits fl27p and 0281) and as well the estimate 029p 
for X < 2. For x > 2 this estimate follows from the explicit expression for 
this derivative 



Let ns consider the equation flTHp for ckt, here (t and rjT are dehned in 
flT^ and flT^ respectively. Due to Lemma [3] this equation has not more than 
one solution. Recall that At is the following event; the equation flT^ has 
solution and consider the statistics /3t dehned in (1151) (here Cq, Ci are the 
constants from the condition (llUp h 

Lemma 4 Under the condition cnii the estimate (^t is '/T-consistent for 
A + /i. Moreover, for some constant C > Q which does not depend on T and 
•d we have the property 


e4\/T(/33.-(A + a))1%C. 


(30) 


Proof. It follows from Lemmae [T] and |2] that 


Ct = Y^ + 2D4> (A + /r) + e, (T). 

Here and below we have for Si (T), z = 1, 2 ... the estimates 


( 31 ) 
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By Lemma m estimates (1^ and the boundedness of (x) we obtain 

as well the relation 

Ct = (A + /i) + £2 {T) ■ (32) 


If we have the event At then it follows from flT^ and fl3^ that 
2?77’$ (cTr) = 2fjT^ (A + /i) + £3 (T). 

This relation, Lemma [3] and the separation from zero by a positive constant 
of the estimator fjT (see Corollary to Lemma [T]) yield for uj G At 

$ {cxt) — (A + p.) = £4 (T). 

Therefore from Lemma [3] we obtain 

E,, (/?T - (A + /i))}' < a (33) 

If a; G A^ then the equation 

F2 + 2D$(A + /i)+73(T)= + 277T<h(x) (34) 

has no solution x G [2co, 2ci]. 

It follows from (1341) . Lemma [Hand the Corollary that the equation 

$ (x) = $ (A + /i) + £4 (T) 


has no solution for x G [2co, 2ci]. 

Hence we can write A^ C {|£4 (T)| > a} = Bt for some positive constant 
a which does not depend on T. This allow us to write 

P(^r) <P(St) = P{k4(T)| >«} <p||VT£4(T)p>a2T| 

C 


E,< 


< 


\/T£4 (T) 


a2T 


< 


T' 


This estimate and (133]) prove the Lemma jH 


Proof of the Theorem [T] The obtained results allow us to prove that 
the estimators defined in (1171) are \/T-consistent. Indeed, from the obvious 
equality 


XjT) 

Xt — A = Pt~ 


yi 


2/2 - yi 


— f^T 


+ 


A 


A + /i A + /r 


{(3t - (A + /i)) 
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and ffT^ we obtain by Lemma H] the estimate 


('Jt (\t ~ ^ 2 E ^ 


/ ^(T) 

Vfl3T 1 — 


A 


\ 2/2 - 1/1 A + /i 


A 


A + /i 


E, 


\/r(/?T - (A + A*)) 


< c. 


The second inequality in (1171) can be proved by the same way. 
Therefore the estimator is V^-consistent. 


4 One-step MLE 


Our goal is to construct the asymptotically efficient estimator-process of the 
parameter d = (A,/i) G 0. We do it in two steps. First we obtain by 
the observations 0 < f < on the learning interval 

the method of moments estimator = {\t^■, f^T^) studied in the preceding 
section. Here 5 G This estimator by Theorem [T]satisfies the condition; 


sup T'^E^ 

■d€K 


dj-s — -d 


2 




here the constant O > 0 does not depend on T and d G 0. Remind that 
0 = (co,ci) X (co,ci). Introduce the additional condition 


A4 (X). For some N > 2 


Cq 2N -|- 9 


(2/1 - 2/2)' 


> 


(35) 


Having this preliminary estimator we propose one-step MLE which is 
based on one modification of the score-function 

At (d, X') = ^ / m(d, s) [dX, - m(d, s)ds], <t<T 

Vt Jo 

as follows 


dt r = -|-t ^14(19^5) / m{dTSiS) dXs — , s)ds 

Jts L 


(36) 


12 
















Here the vector 


s) = {yi - 1 / 2 ) = (^1 “ ^ 2 ) ( 


f Ott {t, ■&) Ott {t, ■&) 


d'd 


\ 


d\ 


dfi 


and the empirical Fisher information matrix It('d) is 


= - m{'d,s)m{^,s)*ds —)■ 1(d) 


t 


Ij'S 


as t —)■ cx) by the law of large numbers. Here 1(d) is the Fisher information 
matrix 


1 (d) 


(di 


(97r(s,d)(97r(s,d)' 
1 / 2 ) Ei,- 


(9d 


d'd 


Let us change the variable t = tT ^ G [0,1] and introduce the random 
process d^ (r), < r < 1, here (r) = d*rp j, and ts = —)■ 0. 

Theorem 2 Suppose that d G 0, <5 G (|, 1) a'nd the condition M (2) holds, 
then the one-step MLE-process is consistent: for an'y z/ > 0 and any r G (0,1] 

P^o{I^tW-4|>^2}^0 (37) 

and it is asymptotically normal 

(d^ (r) - do) ^ AA (0,1(do)-') . (38) 


Proof. Let us denote the partial derivatives 
diT {t, d) 


tta (t, d) = 


. , dn {t, d) .. (9^7r {t, d) 

7r,{t,d) = ^^, 7rx,x{t,d)= ^ 


(9A ’ ^' (9/r ’ ^ ^ 9A2 ’ 

and so on. 

Lemma 5 Suppose that d G 0 and iV > 1. If the condition 


Co 


(di - 1 / 2 )' 


> 


7V + 1 


holds, then 


supE^o (Itta (t,d)|^ + \n^ (t,d)|^) < Ci, 
i?ee ^ 2 


(39) 


(40) 
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and if the condition 


(41) 


Co 2iV + 9 
{yi - y2f 4 

holds, then 

sup ( |7fA,A {t, ^9) 1^ + Itta,^ (t, + lift,,I, {t, 1^) < 6 * 2 - (42) 

Here the constants Ci > 0,C2 > 0 do not depend on t. 

Proof. For simplicity of exposition we write 

tta (t,^9) = tta, (t,t9) = 7r(t,t9)=7r. 

By the formal differentiation of 

dTT = [/i - (A + /i) TT - TT (1 - Tt) (|/i - I/2) (1/2 + idJl “ I/2) Tt)] dt 

+ TT (1 - tt) (yi - 1/2) dXt. ( 43 ) 

we obtain the equations 

dTTA = - TT dt - tta [A + + (1 - 27 r) {yi - 1/2) [1/2 + (l/i - I/2) tt] 

+ 7 r (1 - tt) {yi - y 2 f] dt + tta (1 - 27 r) {y^ - 1/2) dX (t) , ( 44 ) 

dTT/, = [1 - tt] dt - TT/, [A + /i + (1 - 27 r) {yi - 1/2) [1/2 + (yi - 1/2) tt] 


+7r (1 - tt) (yi - 1 / 2 )^] dt + TT/, (1 - 27r) {yi - 1 / 2 ) dX (t). (45) 

If we denote the true value of the parameters by -do and tt (t,-do) = 7r° etc., 

then these equations for d = -do become 

dTTA = - tt'’ dt - tta [Ao + /io + 7r° (1 - 7r°) {yi - 1 / 2 )^] dt 

+ TT^ (1 - 27r°) {yi - 1 / 2 ) dip (t), (46) 

d7r° = [1 - 7i°] dt - 7r° [Aq + /io + 7r° (1 - tt'’) (yi - 1 / 2 )^] dt 

+ tt" (1 - 27r) {yi - 1 / 2 ) dW (t), (47) 

here 

d7r° = [hq - (Ao + Ho) 7r°] dt + tt" (1 - tt") (yi - 1 / 2 ) dW (t). (48) 

This system of linear for tta and equations can be re-written as follows 

{xt = 7 r°, yt = Til, zt = tt", a = Ao -f /io, & = di - 1/2) 

dxt = [/io - aa;*] dt -h bxt (1 - Xt) dWt, (49) 

dyt = -xtdt - [a -h b'^xt (1 - xt)] ytdt + b{l- 2xt) yt dWt, (50) 

dzt = [1 - Xt] dt - [a + b^xt (1 - Xt)] Ztdt + b{l- 2xt) Zt dWt. (51) 
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Note that as Aq > 0 and juq > 0 the process 7r(t,-do) = Xt E (0,1) is 
ergodic with two reflecting borders 0 and 1. Therefore the process 7i° 
is ergodic with the invariant density 




2 (Mn-^n) n 

—-TT ^ 


a; (1 — x) ' 1 

2/io + 2 (Ao — /io) X 

a (A) 1 

. ivi - y 2 f X {1 - x) 

b (1 - a;)]N/^o-Ao )-2 

f 7do 7-^0 1 

G(4) 

X 1 — a; J 


here we denoted 7 = 2 (yi — 1 / 2 ) ^ and G (-do) is the normalizing constant 


G'(do) 





7do 

X 


7-^0 ) 
1 — x] 


dx. 


The processes yt and Zt have explicite expressions 


yt = 


exp 


Zt = / exp 


r-i r 

a + b'^Xs (1 - Xs) - — {I - 2xsf 


d-b J (1 — ^Xs) dWs j x^ dn, 

ft r ^2 

a + b'^Xs (1 - Xs) - — {I - 2xsf 

+b [ (l-2a;,)dlT,l[l — Xy] dn. 


ds 


ds 


Let us put yis = \ — Xs- Then we have 


(52) 


(53) 


X, (1 - I.) - 1 (1 - ix.f = -3x= + i 


and 


yt= f (x„-^)e”(“+^)^*”^^exp|362 I x^ds + 26 f x,dlT,| dn. 


To estimate the moments \yt\^ we note that |x„ 
Holder inequality 


< I and use the 


j \f {v)g{v)\dvj < (j |/(n)|“dnj J |^(n)|^dn 
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with / (v) = exp {—a {t — v) e} and 


g (v) = exp — ( a (1 — e) + — ) (t — n) +36^ / x^ds + 2b I x^dhh^ [>, 


here e > 0. This yields the estimate 


Jo 

here the constant C {N, e) > 0 does not depend on t. Further, we can write 


exp < 3Nb / x^ds + 2Nb / x^dhFs 


= ( exp <1 2Nb I x,dPF« - 2N%'^ I x^ds 


exp Nb'^ {2N + 3) / x^ds 


< exp 


|^(2iV + 3) {t-v) 


because x^ < 1/4 and 


E^q exp 


2Nb 


x.dlF, 


2N^b 


2u2 



1 . 


Therefore, 


E, 


'&0 


\y,f <C(N,e) 


dn 


C(N,e) /'h-A'{«C-<)-4(''+i))»-»>d„. 


We see that if 


-^0 + ho y ^ \ ^ 

{yi - y 2 f 2 2 ’ 


then E,jg \yt\^ ^C. In particular, if in the condition (13^ we put N = 2 and 
choose sufficiently small £ > 0, then we obtain the estimate 


sup E,jo 

)9oG0 


avr it,'do) 

dX 


<C, 


(54) 
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here the constant C > 0 does not depend on t. 

We need as well to estimate the derivatives flUl) . fHS]) for the values '& 7 ^ 'do- 
The equation for tta becomes 

dTTA = - vr df - tta [A + /i + (1 - 27r) {y^ - 1 / 2 )^ (tt - 7 r°) 

+7r (1 - tt) {yi - 2 / 2 )^] df + tta (1 - 27r) (j/i - j/ 2 ) dW (t). (55) 

Hence if we put a = X + y,yt = ttx and h = yi — y 2 , then we obtain the 
equation 

dyt = -xtdt - [a + 6^ (1 - 2xt) {xt - x^) + b‘^Xt (1 - Xt)] ytdt 
+ b{l- 2xt)ytdWt. 

The solution of this equation can be written explicitly like fl52|) but with 
additional term 6 ^ (1 — 2xt) {xt — x^) in the exponent. This term satishes 
the inequality 

(1 - 2xt) {xt - x°) > - 1 . 

Hence if we repeat the evaluation of the \yt\^ as it was done above, then 
for it boundness we obtain the condition 

X + y 3 N 

( 2/1 - 2 / 2 )^ 2 2 


For the second derivative vr = {t, i9) we obtain the similar estimates 

of the moments as follows. The equation for tt is 

dvr = -yt [2 - 2b^yt (xt - x°) + 2b%t (1 - 2 xt)] dt - 2by‘f dWt 

- TT [a + 6^ (1 - 2xt) [xt - x°) + b'^xt (1 - x*)] dt + fevr (1 - 2xt) dWt. 
Let us write it as 


dvr = H (t) dt + B (t) dWt — vr [a + (F (t)] df + ittD (t) dWt 

in obvious notations. Hence the solution of it is 
d^TT{t,^) 


dX^ 


e-/.‘KcW-iD(.)"ld.+/; o(.)d». (.J,) ^ g ^ 


We have the corresponding estimate 

C (s) - ^ =b^(l- 2xs) {xs - x°) + ^ [2xs (1 - Xs) - (1 - 2xsf] 


> —r — 3r ( X — 


b^ 


352 


+ — =-:-35 X — 


> -- 


352 
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because {x — \ 



Therefore, 



2 Ar + 3 

4 



and if 


\ + jji 9 N 

iyi - y2f 4 ^ 2 ’ 


then we obtain 




dX^ 


N 

< c. 


Hence under condition flTTl) we have 


sup 


d^7r{t,^) 

dX^ 


N 

< c. 


The similar estimates can be obtained for the other derivatives. Lemma [5] is 
proven. 

Lemma 6 The solutions {xt,yt,Zt) of the equations fH9|) - fl5T|) have ergodic 
properties. In particular, we have the following mean square convergence 


1 

T 



T 


tua {t^'dof dt 


rfix {t, 'do) rhf, {t, ■do) dt 


T 


rn^{t,dof dt 


T 


yl dt 


7F / ytzt dt 


T 


Zt dt 


Ill ('do) ; 

Ii2 ('do), 
I22 ('do), 


Proof. For the proof of the invariant measure existence see [3], section 4.2. 
Note that the equations flT9|) - fl5T]) do not coincide with that of |3], because 
there it is supposed that A = /i and yi = 1 , 1/2 = 0, but the arguments given 
there are directly applied to the system of equations fl49|) - (l5T]) too. 

Recall that the strong mixing coefficient a {t) for ergodic diffusion process 
fl43|) satisfies the estimate 


a{t) < e-d*i 
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For the proof see Theorem in |15) . To check the conditions of this theorem 
we change the variables in the eqnation 


r dn 

^t = g{xt), g{x)= ——-xe(0,1) 

Ji /2 bv (1 - v) 

and obtain the stochastic differential equation 

d^t = + dWt, ^o = g{xo), 0<t<T. 

The process > 0 has ergodic properties and the drift coefficient A (•) 
satishes the conditions of this theorem. 

Now to verify the convergence 



we can apply the result of the following lemma. 

Lemma 7 Let {Yt,t >0} be a stochastic process with zero mean and for 
some m > 2 and k > 1 


E \y,\ 


m{2k—l) 


<^ 1 , 


dt<C2, 


here a {t) is the strong mixing the coefficient. Then 


E 


Ytdt 


2k 


< C 3 T\ 


Proof. For proof see Lemma 2.1 in [9]. 

Therefore if we put Yy = m = 3 and k = 1, then we obtain 

the convergence fl56|) . 


Let us verify the consistency of the one-step ML E-pro cess. We can write 

{\^T (r) - < P^o {[4^ - ^0 

' m{'dTS,s) dXs — m{idTS, s)ds 


V 

^ 2 




tT 


f 'j'S 





> - 
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For the first probability by the Theorem [T] we have 


1?0 


— -do 




— '(9o 


< 


C 


Z/2T'5 


^ 0 . 


The second probability can be evalnated as follows 


1?0 


< Pl ?0 


TT JrpS 


7 h{i!}rps, s) dXs — m{dTSi s)ds 


-l 

-1 


> - 






TT JrpS 


7h{dTS-, s) dhFs, 


> 


z/ 


tT 


/ 'j'S 


m{Jrps , s) Am ( JrpS , s) ds 


\ 


j ds 



here Am sj = m('do, s) — m{Jrps^ s). We can write 


tT Jps 

IrT0Ts)-^ 

< - 


m{-&rps, s) dlW 


T'y 


stT 


^(5—7 


I jj'5 


m{Jrps^ s) dhFs 


here 7 is such that 5 — 7 > ^. Hence 

IrA^Ts)-^ r 




rT 

- ^’^0 1^5-7 
+ P^o 


/ 'j'S 


m{Jrps^ s) dhFs 


> 


r-rT 


/ 'j'S 


rh('dr 5 , s) dWs 


> 


IrTi'^T^) 


-1 


U 


T'y 


>T>^o, 


as T —)■ cx), because 


■^0 1 j^s-rf 

< 


stT 


f 


m{Jrps^ s) dWs 


> 


1 


p,J^25-2y 


E, 


'l^o 


f 7^5 


rh('d'r 7 


ds < 


C 


JjT^5-2rf-l 


0 . 
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Recall that 25 — 27 — 1 > 0. Further 


'&0 


tT 


1 ptT 



>il 

- / rh('i97’«,s) 

1 

'T' 

0 

ds 

Jt^ 



V 


< 


rT^-y 


prT pi 

I m{'&TS,s) / s)*dvds —'do 

Jo 


> 


^0 


T'y 


> 


V 


0 


as T —)■ 00 , because = O {T and other terms are bounded 

probability. Here = "do + — 'Jo)- 

To prove fl38|) we write 


in 


AtT (r) - 190 ) = ^'dr'5 - '*^0 j + 




/ 'j'S 


\/tT Jt^ 
We have the estimate 


mi^rps^s) m{'Jo, s) — s) 


rfiAj-s, s) dWs 


ds. 



1 




yj tT Jt^ 

TfiAj-s, s) — m{'Jo, s) 

dM/. 


ctT 


< 


E 


1?0 


7fi{-Jrps, s) — rh('do, s) 


ds —)■ 0 


TT JrpS 

as T —)■ 00 , and by the central limit theorem the convergence in distribution 


-tT 


m(i9o,s) dW, ^AA(0,I(i9o)). 


VtT Jt^ 

Further, let us denote Vj-s = VtT — ’Jq^ , then we can write 


"Vj-s + — f m{'Jrps,s) mAo, s) — 1 A 

y/rT Jt^ 


1 ptT 


ds 


= IrT^Ts) |^IIrr(^9r«) ~ J J ^ m(i9r«,s)rh('d^,s)*dr ds ) Vts, 
here ■dr = 'Jt^ + r ( 'Jt^ — 'Jo)- The presentation 


771 ( 19 ^, s) = s) + J m{'Jq, s)dq {^'Jt^ —'Jo 
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and the equality 


I . 

/ m{'i}rps, s)m{'i}rps, s)*ds 

TT Jj-5 

allows us to write 


Vj'S 


y/rT 

= \/ tT 


-1 r-rT 


f 'J '5 


m{-do, s) — m{-dTS, s) 


ds 


- '^Q 


0 ( 1 ) =T^-^ 0 ( 1 ) 


0 , 


as T —)■ cx), 

Let us verify that the Fisher information matrix is non degenerate. It is 
sufficient to show that the matrix 


J(^o) 


{E^^ytZt, j ) 


is non degenerated. Here yt, Zt are stationary solutions of (150]) and CB 
respectively. If this matrix is degenerated, then 


— i^&o'ytZt)'^ . (57) 

Recall that by Cauchy-Schwarz inequality 


(Ei^Q^/t^t) ^ E,Jp|/^ 'E^gZ-j- 


with equality if and only if Zt = cyt with some constant c 7 ^ 0. Therefore in 
the case of equality we have [cyt — Ztf' = 0 . 

Introduce a new process Vt = cyt — as a solution of the equation 
dhi = [xt (1 - c) - 1 ] df - [a + b'^Xt (1 - Xt)] Vtdt + 6 ( 1 - 2 xt) Vt dWt, 

here Vt and Xt are stationary solutions. 

Further, following |3j. Section 4, here the similar estimate was obtained, 
we write this solution as 

Vt = hoe““* + f [7^ (1 ~ c) — 1] ds — 6^ f (1 — Xg) ds 

Jo Jo 

+ b [ (1 - 2£,) h, dlH,. 

Jo 
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Hence 




[x, (1 - c) - 1] ds^ < 4 (1 + e-2“*) 
® Jo -Lo Jo 


with some constant C > 0 which does not depend on t. Recall that 
does not depend on t too because Vt is stationary solution. Therefore if we 
show that for all c 


lim E^q 

t—>-oo 



c) — 1] ds 


> 0 , 


then the matrix J (■do) is non degenerate. The random process 


Ct= e [x^ (1 - c) - 1] ds 


is the solution of the equation 

dG 


df 


=-aCt + Xi (1 - c) - 1, Co = 0. 


The elementary calculations show that for all 'do and c 

> 0 , 


lim E^oCt = 

r—>-cxD 


2 [tto (1 - c) - 1]^ 


here ttq is the stationary distribution. 

Therefore the Fisher information matrix I ('do) is non degenerate for all 
■do G 0. 

Note that the limit covariance matrix of the one-step MLE-process by the 
Theorem [2] coincides with the covariance of the asymptotically efficient MLE 
[3], therefore "d^ (r) is asymptotically efficient too. 


5 Discussions 

The learning interval in one-step section is [0,T^], where 6 G (^, 1), i.e., it is 
negligeable with respect to the whole observations time T. It can be done 
even shorter, if we use two-step MLE-process approach, as it was proposed 
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in [13]. It corresponds to the learning interval [0,T^) with 6 G (|,|]. The 
procedure is the follows. First we obtain the preliminary estimator as 
before. Then we introduce the second preliminary estimator 

t e [t\t] 

and then we dehne two-step MLE-process 

= Kt + X*), te [T^ T] , 

here 

1 /"* 

At X*) =—= m s) [dXs - m (^92, s) ds], te [T^,T] . 

V t Jt^ 

It can be shown that for all r G (0,1] and t = tT we have the asymptotic 
normality of the estimator (r) = 

VXT (^9- (r) - ^9o) ^ AA (O, I (4)“') • 

See the details in [T3] . 

Note that it can be shown that the one-step MLE-process converges in 
distribution to the limit Brownian motion. Let us introduce the random 
process 

rjT (r) = r\/TI('(9o)"^/^ (^9^ (r) - ^9o), < r < 1, 

here —)■ 0. More detailed analysis shows that the random process 

(r), r* < r < 1 converges to two-dimensional standard Wiener process 
IT (r), r* < r < 1 with any r* G (0,1]. For the details see the proof of such 
convergence in similar problem in |l3j . 
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